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Abstract 

Channel state information (CSI) in the interference channel can be used to precode, align, and 
reduce the dimension of interference at the receivers, to achieve the channel's maximum multiplexing 
gain, through what is known as interference alignment. Most interference alignment algorithms require 
knowledge of all the interfering channels to compute the alignment precoders. CSI, considered available 
at the receivers, can be shared with the transmitters via limited feedback. When alignment is done by 
coding over frequency extensions in a single antenna system, the required CSI lies on the Grassmannian 
manifold and its structure can be exploited in feedback. Unfortunately, the number of channels to be 
shared grows with the square of the number of users, creating too much overhead with conventional 
feedback methods. This paper proposes Grassmannian differential feedback to reduce feedback overhead 
by exploiting both the channel's temporal correlation and Grassmannian structure. The performance of 
the proposed algorithm is characterized both analytically and numerically as a function of channel length, 
mobility, and the number of feedback bits. The main conclusions are that the proposed feedback strategy 
allows interference aUgnment to perform well over a wide range of Doppler spreads, and to approach 
perfect CSI performance in slowly varying channels. Numerical results highlight the trade-off between 
the frequency of feedback and the accuracy of individual feedback updates. 
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I. Introduction 

Interference alignment (lA) is a precoding strategy that attempts to align interfering signals in time, 
frequency, or space. By aligning signals, lA reduces the dimension of the interference observed at each 
receiver and achieves the channel's maximum multiplexing gain. With alignment, network sum rate can in- 
crease linearly with the number of communicating users ID. Various methods of calculating lA precoders 
have been proposed lH-lOl. While Q-lOl are algorithmically different, they share the common feature 
that channel state information (CSI) for the interference channels is required to compute the lA solution. 
In network architectures envisioned for lA, the channels would be known at a centralized processor that 
computes the lA precoders and distributes the solution to the transmitters, or each transmitter would 
have joint knowledge of all the CSI to compute the solution independently. In either case, a reasonable 
approach for satisfying the CSI requirement is to feedback quantized CSI through limited feedback, of 
which a detailed overview can be found in lITOl and the references therein. A straightforward application 
of limited feedback, however, incurs significant overhead in lA networks. The reasons are that (i) feedback 
about every interfering link is required from every receiver and (ii) high resolution CSI is required to 
achieve alignment. As a result, low-overhead methods to transfer CSI must be designed to realize the 
data rates promised by lA. 

Several approaches have been proposed to reduce feedback overhead in lA networks ifTTl - lfTSl . In ifTTI . 
knowledge of channel gains is used to partition the network into optimally sized alignment sub-groups. 
In |[T2l . lA is applied to partially connected interference channels. Exploiting the network structure as in 
ifTTl . |[T2l limits the number of channels that must be fed back but does not provide better compression 
or higher quality CSI. To reduce the number of bits needed to feedback a channel response, [13] uses 
Grassmannian codebooks to quantize the channel coefficients in single-input single-output (SISO) systems 
with frequency extensions. The work in |[T4l extends that of |[T3l to systems with multiple antennas. 
Both |[T3l and |[T4l show that lA with limited feedback can achieve a system's degrees of freedom as 
long as feedback bits scale sufficiently with SNR. As the number of feedback bits increases, however, 
complexity increases and limited feedback becomes less practical as very large codebooks are undesirable. 
Analog feedback overcomes the problem of scaling complexity since the CSI distortion decreases naturally 
with SNR ifTSl . Analog feedback, however, is only optimal when the number of feedback channel uses 
matches the number of symbols being fed back |[T6l . Further, a main limitation of the feedback strategies 
in |[T3l - |[T5l is that they neglect the temporal correlation in the channel that can be exploited to further 
reduce overhead and improve CSI quality. 
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Differential and predictive feedback strategies have been proposed for compressing various forms of CSl 
in time-varying channels |[T7l -l l24l . Simplified differential strategies, such as IITtI . have been considered 
in commercial wireless standards to feedback channel quality indicators. In |[l9l differential feedback 
of multiple-input multiple-output (MIMO) channel correlation matrices is proposed by leveraging the 
geometry of positive definite Hermitian matrices. The strategy in |[T9l was shown to work well when 
implemented on a wireless testbed ll20l . In II2T1 the differential feedback of unitary precoders is considered 
in a single-user MIMO setting. The geometry of lA's required CSI, however, neither fits the structure 
considered in lITTl . |[T9l - |[2n . nor the parametrization derived in |[22ll . Thus, the feedback strategies in 
113, |[T9l - |[22l cannot be used. In lITSl . tree-structured beamforming codebooks are constructed allowing 
receivers to feedback differentially encoded information about a local neighborhood in the feedback 
tree. In |[23l differential feedback is applied to beamforming vectors that are gradually rotated on the 
Grassmann manifold using a Householder matrix to improve quantization accuracy. In ll24l Grassmannian 
beamforming vectors are fed back and reconstructed using predictive vector quantization tools, leveraging 
manifold concepts such as tangent vectors and geodesic curves. Unlike the single-user system considered 
in ll23l . however, lA systems are sensitive to leakage interference, and require more feedback than the 
broadcast channel ifTSl . |[24ll . As a result, the application of lITSl . Il23l . Il24l to lA networks is non-trivial 
and it is not clear whether they can achieve sufficient performance with practical codebook sizes. 

In this paper, we propose a limited feedback strategy based on Grassmannian differential feedback 
to fulfill the CSI requirement in wideband single antenna systems using lA over frequency extensions. 
Frequency extensions provide a practical source of dimensionality for lA precoding in SISO interference 
channels and play an integral role in achieving the maximum multiplexing gain in both single and multiple 
antenna systems lUl. Although we present results for single antenna systems, the feedback strategy can 
be generalized to systems with multiple antennas in a manner similar to |[T4l . The proposed strategy 
tracks the slow evolution of the normalized channel impulse responses on the Grassmannian manifold. 
At each feedback update, a quantized tangent vector relating consecutive channel realizations is used at 
the transmitter to reconstruct the channel. Since the tangent space geometry changes at each iteration, and 
varying Doppler spreads lead to different channel dynamics, we construct quantization codebooks that 
adapt to the channel's geometry and dynamics. Unlike ll24l . no prediction is performed and differential 
feedback is used in the new context of SISO interference channels. We show that the performance of the 
proposed differential strategy cannot be improved upon by a simple yet commonly used class of linear or 
geodesic predictors. Thus, the development of more sophisticated prediction algorithms that can further 
improve CSI quality remains a promising venue for future work. 
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We derive an approximation for the average distortion achieved by the proposed algorithm and show 
that it is accurate for a wide range of channel and feedback parameters. Unlike the analysis of the 
Householder strategy in |[23l and the predictive strategy in i24|. which neglect either the quantization 
process or the channel's Doppler spread, the derived performance characterization captures the effects of 
both fading speed and codebook size. Simulation results show that the proposed algorithm outperforms 
memoryless quantization and other competitive feedback strategies for temporally correlated channels 
ll24l . Through simulation, we highlight also the trade-off between feedback refresh rate and codebook 
size ||25l . We show that, from a CSI distortion perspective, frequent updates using smaller codebooks 
may often be preferred to less-frequent updates with a large codebook. Finally, while scaling codebook 
size is still required to preserve multiplexing gain, we show that lA with the proposed feedback strategy 
performs well in the range of medium to high SNR with a relatively small number of feedback bits. 

This paper is organized as follows. Section HIl introduces the SISO wideband interference channel model 
with orthogonal frequency division multiplexing (OFDM). Section |lll] reviews the concept of interference 
alignment over frequency extended channels and highlights the degradation due to imperfect channel 
knowledge. Section |IV] presents the proposed feedback framework while Section |V] elaborates on its 
main design parameters. Sections |Vl] and IVIII present analytical and numerical results respectively on 
the performance of the Grassmannian differential feedback, as well as the performance of lA when the 
proposed strategy is used for CSI feedback. We conclude with Section IVIIII 

We summarize the notation employed in this paper: A is a matrix; a is a vector; and a is a scalar; 
(•)* denotes the conjugate transpose; A o B is the Hadamard product; || Ajli;' is the Frobenius norm of 
A and trace(A) is its trace; ||a|| is the 2-norm of a; \a\ is the absolute value of a; I^v is the N x N 
identity matrix; Oat is the N-dimensional zero vector; is the A^-point discrete Fourier transform (DFT) 
matrix; diag(a) is the diagonal matrix obtained by putting the elements of a on its diagonal; [a, b] is a 
horizontal concatenation of vectors a and b, is the A^-dimensional complex space; C7V(a, A) is a 
complex Gaussian random vector with mean a and covariance matrix A. Expectation is denoted by E [•]. 

II. System Model 

Consider a frequency selective SISO interference channel with K communicating user pairs. Each user 
k communicates desired data to its paired receiver k and interferes with all other receivers I ^ k. The 
wideband channel between transmitter £ and receiver k at time t is given by the L-tap channel impulse re- 
sponse vector h.ki[t\ = [hk/[t, ^]* ,hk/[t, 1]*, • • • , hk/[t,L - 1]*]* \lk,l G {1, . . . , A'}. Channel impulse 
responses are drawn independently across k and I from a continuous distribution and have covariance 
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matrix E [hfc^^[t]hfc^£[t]*] = Rhi ,,i Vfc,£ € {1, . . . ,K} and Vt. Letting pk/ be the average attenuation of 
user ts signal at user fc's receiver, the channel response satisfies trace (Rh^f) = Pk/- In the case of 
uncorrelated channel taps, Rh^^ ^ = diag (pfc/), where pk.e is the L x 1 vector representing the channel's 
power delay profile. 

Using orthogonal frequency division multiplexing (OFDM), the transmitters transform the observed 
frequency selective channels into a set of N^c non-interfering narrowband subcarriers. Stacking each 
received OFDM symbol in a vector, the input-output relationship can be written in matrix form as 

yk[t] = Hfc,fc[t]xfc[t] + Y^Uk,i[t]Mt] + Vfc[t], (1) 

where Xfc[t] is the OFDM symbol sent by user k at time t with the average power constraint E [||xfc[t] |p] = 

represents the channel frequency 



NscP, the iVsc X A^sc matrix Uk/it] = diag [J^n,,^ [hfc/W^ ^K.-L 
response between transmitter £ and receiver k at time t, and V/t[t] is the i.i.d. CJ\f{0,a^lN^J thermal 
noise observed by user k. The system model assumes perfect time and frequency synchronization, and 
a cyclic prefix that is long enough to accommodate the impulse response of all channels as well as the 
potential propagation delay between users. 

Further, the relationship in ([T]) implicitly assumes that the L-tap channels hfc^[t] seen by the OFDM 
symbol remain constant over the OFDM symbol time and pilots can be used to estimate them at the 
receiver. The channels over consecutive OFDM symbols, however, are considered to be slowly varying. Let 
/d be the Doppler frequency of the channel and Tg be the OFDM symbol time. We assume channels are 



Pki 



temporally correlated according to the model proposed by Clarke II26I . i.e E 
Jq{2t: f^Tf^m) where Jq is the 0-th order Bessel function of the first kind. A main underlying assumption, 
however, is that while the channels vary slowly over time, their length L remains fixed. Since channel 
response length, is likely to vary slower than the channel response itself, and since it is constrained to 
be a small integer, the channel length can be fed back to the transmitter if necessary at a small extra 
overhead cost. In general, the proposed feedback algorithm can be applied to all correlated channels, 
however, further details on the mathematical structure of the temporally correlated channel are given 
whenever a result is model-dependent. 

Throughout the remainder of this paper we make the following two system-level assumptions: 
• We assume that the channels h^t ^[t] are known to receiver k. This implicitly assumes that the 
set of K transmitters coordinate (or orthogonalize) their pilot transmissions allowing receivers to 
estimate all incoming channels. We make a further simplification by assuming receiver CSI is perfect. 
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• We assume each receiver has an error-free logical feedback link to all transmitters. The feedback 
link from receiver k to transmitter £ may either be a direct link, or a link through other nodes such 
as transmitter k and/or a centralized processor. Effectively, this allows obtaining global CSI at at 
least one node to enable the calculation of lA precoders which, if needed, can be forwarded to the 
other transmitters. 

While the importance of transmitter channel knowledge makes the two assumptions rather standard 
in the literature on precoding for the interference channel lUl, 111, m-lH, ifTTl - lfTSl . or interference 
coordination in general |[27l . they are by no means trivial. Coordinating training phases, feedback phases, 
and centralized processing constitutes significant complexity in both cellular and ad-hoc systems. We 
refer the interested reader to ||271 and the references therein for a detailed discussion of the limitations, 
the benefits, and the possible solutions that enable coordination in the case of cellular systems. 

III. Interference Alignment in Frequency 

In this section we review the concept of lA over frequency extensions when perfect channel state 
information is available at the transmitter W then summarize the effect of imperfect transmitter channel 
knowledge on the performance of lA. 

A. lA with Perfect CSI at the Transmitter 

lA for the SISO interference channel can achieve the maximum degrees of freedom when coding over 
infinite channel extensions H]. Using lA over N frequency extensions, each transmitter k at time t sends 
dk < N symbols, along the x 1 precoding vectors f™[i]. As a result, the transmitted symbol is 

xfc[i] = x;frw^rw, (2) 

m=l 

where ||ffc'[t]||2 = 1 and E [|x^[t]p] = NP/d^, such that the power in each N subcarriers is NP 1281 . 
The transmit directions f ™ [t] are calculated such that the interference from K — 1 users is aligned at all 
receivers, leaving at least dk interference free dimensions for the desired signal. The number of transmitted 
symbols dk can be chosen according to the original strategy in HI or the improved method in Q, both 

of which asymptotically achieve the maximum degrees of freedom, i.e. limTv s>oo ji'^dk = K/2. 

Note that users need not code over all A'sc subcarriers since algorithm complexity increases significantly 
with the number of subcarriers whereas the achieved multiplexing gain may increase very slowly with 
Nsc as demonstrated by the LA solutions in HI. Users could potentially treat any A^-subset of subcarriers 
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as an independent coding group and thus have [-^J parallel alignment groups in an OFDM symbol. In 
this case Q applies to each group of subcarriers, and one can write [^J relationships for each OFDM 
symbol. For simplicity of exposition, in the remainder of this section we assume [^J = 1. 

To quantify concisely the degradation due to imperfect CSI, we consider a zero-forcing receiver; other 
receiver designs can be used. At the output of the linear receivers w™[t], the received signal is 



+ EE<M*Hfc,Mf/[t]4[t]+w^[i]*Vfc[t], 



(3) 



for m G {1, . . . , dk} and k G {I, . . . , K}, where ||w 
conditions for perfect interference alignment are 



1. Assuming a zero-forcing receiver, the 



w^[t]*H,,fc[t]ff[t]=0 
w^[t]*Hfc,,[t]f/[t] = 

\^ntriik,k[tmt]\ > 



V/c, £ m 

Mi / and Vm, 

V/c, m 



(4) 
(5) 
(6) 



where alignment is achieved by (HI and (HJ), while © ensures the decodability of the d^ desired streams. 
The sum rate achieved by the linear zero-forcing receiver |[29l . assuming Gaussian input signals, is 

K dk ^ / A''sc-P l,,,rrar+l*XJ, . [+1 P"i Ul 1 2 \ 



\^^^f[t]*Uk,km 



(7) 



k=l m=l 

where ^ [t] is self-interference from other transmit streams and X| ^ [t] is the interference from other 
users. The two interference terms are 



dk 



d. 



di 



(8) 



wr[t]*H,,,[t]f/[t] 



If perfect channel knowledge is available, and the number of symbols dk are feasible, equations (HJl-® 



can be satisfied with probability one and thus , 



0. This gives 



lim 



^ 1 , / ^|wrM*Hfc..Mfr 
E -wtM^ 1 + 



= lim 

P-!>oo log2 P P->cx) 



k.m 



K 



(9) 
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which confirms the fact that lA achieves the K/2 degrees of freedom of the K user interference channel. 
While the achievability proof in |fTl| assumed independent fading across subcarriers, |[T3l has claimed that 
fading on each subcarrier need not be independent provided that the channel impulse response is long 
enough. 



B. The Effect of Limited Feedback 

In practical systems, receivers cannot feedback the channels with infinite precision, thus only a 

distorted version of is available when calculating lA precoders. The distortion caused by limited 

feedback implies that interference will not be perfectly aligned. The power of residual misaligned 
interference increases along with transmit power and causes the SINR in the desired signal subspace 
to saturate, which limits sum rate at high SNR ||29l . As a result, low overhead feedback strategies must 
be designed to improve CSI accuracy and allow good lA performance. 

To establish the impact of limited feedback on lA sum rate, and to develop insights into the structure of 
a good quantizer, we summarize the calculations in |[T3l . |[T5l . In |[T5l . it is shown that if the transmitters 
use the estimated or quantized channels i^k,^, V/e, i £ {I, 2, . . . , k} to calculate lA precoders, then the 
mean loss in sum rate can be upper bounded by 



< 



1 



1 + 



jl I J2 



(10) 



Using the results from |[T3l Section IV-B], the components of the sum leakage interference, m' 
can be bounded by 

2N 



wr[t]*H,,[t]f/[t] 



< 



di 



lwrMof/Wfl|h.,MlP 



1 



kA[ 



Ih 



kA [ 



(11) 



where i^[t\ and w™[t] are the precoders and combiners calculated using imperfect CSI. The bound 
in (fTTl ) relates the power of leakage interference to the quality of the quantized CSI and suggests 
mathematical structure that can be exploited by the quantizer. The last term in ([TT] ) indicates that only 
knowledge of the normalized channels is necessary for lA, further the CSI is phase invariant since 



l|h. 



Il|h. 



This is not surprising since, if an interfering channel is scaled 
and rotated by a G C, the interference subspace remains unchanged, i.e., diag (^Ar^^ahfc = 
adiag {^N^Ja.k,i[t\) fi^M- As a result, the CSI needed for lA with frequency extensions evolves on the 



manifold of L-dimensional unit norm, rotationally invariant vectors otherwise known as the Grassmannian 
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manifold Ql^i BOl - BBl . Define tlie chordal distance between two points on the Grassmannian manifold, 



xi and X2, as (i(xi,X2) = y 1 — jx^X2| . The bound in ([TT]) can be rewritten as 



2 N P 

wrM*H,,,[t]f/[t] < -^\\^f[t]of^[t]f \\h,4t]f d(gfc,[^],g,,[^])^ (12) 

where gk.i[t] — \\hl'f [ t ] \\ ^^'^ Sk,i[t] — , showing that leakage interference can be bounded using 

the chordal distance between the true and quantized channels |[34l . 

IV. Grassmannian Differential Feedback 

In this section, we present the proposed feedback algorithm. Under some conditions, we show that 
differential quantization, a special case of predictive vector quantization, cannot be improved upon using 
a simple yet common class of predictors. 

A. Differential Feedback Framework 

We propose to differentially encode the normalized channel response vectors by exploiting both their 
Grassmannian structure and temporal correlation. The proposed algorithm encodes CSI increments using 
a tangent vector which defines the geodesic path between consecutive channel realizations. Our feedback 
framework uses tools presented in |[34l . which were simplified and first used for MISO broadcast channel 
feedback in |[24l . We consider the separate quantization of the slowly varying normalized channel vectors 
gfc,i[i] — Vt > 0; joint quantization may offer higher performance at the expense of additional 

complexity |[35l . Therefore, we restrict our attention to one of the channels and drop the user subscripts. 

The smooth structure of the manifold allows each two points to be related using the concept of a 
geodesic path as shown in Fig. [1] The geodesic curve describing the path between g[t — 1] and g[t] is 
given by 

G(g[t - l],e[t],£) = g[t - l]cos {\\e[tW) + ^.\n{\\e[t]\\l) . (13) 

where the vector e[t], also shown in Fig. [T] is known as the tangent vector. The tangent vector can be 
viewed as a length-preserving unwrapping of the geodesic path onto the tangent space at g[t — 1]. The 
tangent vector is given by 

.r<i , -if4t]\ slt]/p\t]-slt-l] 



where p[t] = g[t - l]*g[t] and d[t] = \J I - \p[t\\^ (Ml- From (O, it can be verified that G{g[t - 
l],g[t],0) =g[t-l], G(g[t-l],g[t],l) =g[t] and that ||G(g[t - l],g[t],£)|| = 1, since the tangent 
vector e[t] is orthogonal to the base vector g[t — 1]. 
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Previous channel (g[t-l]) 
Current channel (g[t]) 
Tangent vector (e[t]) 
Geodesic curve (G(g[t-l],e[t],l)) 
Tangent codebook 



Fig. 1. A 3D visualization of the Grassmann manifold and the tangent vector quantization process. The quantized tangent 
which is selected from the tangent codebook is also marked in red. 
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Fig. 2. Block diagram of Grassmannian differential quantization and feedback at the transmitter and receiver. 



The tangent vector and geodesic path, which together relate two points on the manifold, can be 
used to build a differential feedback framework to track the evolution of CSI |[24ll . Given the previous 
channel realization and a tangent vector received by feedback, the transmitter can reconstruct the current 
channel by applying (fT3l) . The operation of the Grassmannian differential feedback algorithm at both 
the transmitter and receiver is given in the block diagram of Fig. [2] For clarity, the pseudo code used 
to encode the channel evolution over time is given in Algorithm [T] At each new channel realization, 
the receiver estimates the normalized L-tap channel g[t]. Perfect channel estimation is assumed here to 
decouple the quantization error from the estimation error. Given the current observation, and the quantized 
channel in the previous iteration, g[t — 1], the receiver calculates the tangent vector, e[t\, from g[t — 1] 
to g[t\. The receiver then quantizes the tangent vector to produce a quantized tangent, e[t]. The details 
of the quantization method are given in depth in Section |Vl This quantized tangent is then fedback to 
the transmitter over a delay and error free feedback link. Equipped with the previous quantized channel, 
g[t — 1], and the quantized error vector e[t], both transmitter and receiver can now calculate the new 
quantized channel g[t]. This vector g[t\ will later serve as the base point in the next iteration. The 
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Algorithm 1 Receiver 

1: Input: g[t-l] 

2: for all t = 1,2, . . . do 

3: Estimate the channel g[t] 

4: Calculate tangent between g[t — 1] and g[t] using (fT4l ) 

5: Quantize and feedback the quantized tangent vector e[t\ 

6: Reconstruct the quantized channel g[t] = G{g[t — l\,e[t\, 1) 

7: end for 

Algorithm 2 Transmitter 

1: Input: g[t-l] 

2: for all t = 1,2, . . . do 

3: Receive the feedback tangent vector e[t\ 

4: Reconstruct the quantized channel g[t] = G{g[t — l\,e[t\, 1) 

5: end for 

6: Output: g[t] 



algorithm runs in parallel for all channels gk,i[t], VA;, i and the transmitters use the estimated CSl vectors 
gk,i[t], yk,£ to do interference alignment. 

B. Comparing Differential vs. Predictive Quantization 

The feedback framework of Section IIV-AI is a special case of predictive vector quantization, the main 
difference being that the proposed strategy does not attempt to further improve CSl by predicting future 
channel realizations. In this section, we begin by formalizing the main difference between predictive 
and differential feedback. We then show that, for a class of channels, commonly used Grassmannian 
predictors such as those in ||23]| . Il24l . yield no performance enhancement over the differential strategy 
presented. We note, however, that more sophisticated prediction functions may in fact further improve 
performance. Thus, the development of more intelligent Grasmannian predictors is a promising area for 
future work. 

1) Predictive Quantization: The main idea behind differential feedback is to send the transmitter 
information that allows it to move from the old quantized channel to the new channel realization. Thus, 
a tangent vector e[t\ is calculated between the base point g[t — 1] and g[t], and is then quantized to give 
e[t\. Instead of starting the tangent vector at the point g[t — 1], general predictive quantization calculates 
the tangent starting at another point g[t\, known as the predicted channel vector. The premise is that if 
g[t\ is "closer" to g[t\ than g[t — 1] (the base point used in differential feedback), quantization accuracy 
can be further improved. The main difficulty now lies in constructing prediction functions that actually 
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yield closer or better estimates of g[t]. 

Unfortunately, work on prediction on the Grassmann manifold is rather limited. In this paper, we 
consider a class of linear predictors used in |[23l . ll24l . Though the predictors in |[23l . ll24l are presented 
differently, the predicted channel vectors can be written in a unified manner as 



m = G{g[t - l],ep[t],£p) = g[t - 1] cos (||ep[t]||£p) + 



sin(||ep[t]||^p) 



(15) 



where ep[t\ is the predicted tangent vector and £p is a predicted step size. Since the norm of ep[t] can 
absorb any prediction step size £p, we enforce £p = I without loss of generality. 

The challenge now is to use commonly known information, such as g[t] VO < t' < t - 10 to find 
the best predicted tangent vector ep''*[t], i.e., the one that maximizes E |g[t]*g[t]p and makes g[t] the 
best estimate of g[t]. Thus, the prediction problem can be summarized as solving 



e°P*[t] =argmax E |g[t]*g[t]| 



(a) 



argmax E 

ep[i] 



(16) 



cos(||ep[t]||)g[t-l]+sin(||ep[t]||) 



h[t] 



where (a) is by replacing g[t] by its definition in ( fTSl ) and by noticing that maximizing E |g[t]*g[t]p 
is equivalent to maximizing E [|g[t]*h[t]p] . In what follows, we show that for a class of temporally 
correlated channels, such a geodesic predictor yields no performance enhancement over the differential 
feedback strategy presented. Therefore, to ensure that prediction does in fact improve performance, 
more sophisticated predictors that better exploit the channel's Doppler spectrum must be derived. The 
development of such predictors is a promising topic for future work. 

2) Performance Comparison & Prediction Gains: For the analysis in this section, we make the 
following assumptions on the channel's temporal structure. 

Assumption 1: We assume that the channels h[t] are temporally correlated according to a first order 
autoregressive model [36;], and are thus given by h[t] = ri{h[t — 1] + \/l — r]{'^z[t]. For simplicity, we 
further assume that h[t] ~ CM{0, ^1^) which in turn implies that z[t] ~ C^f{0, ^II)- 

While we present our analysis for channels satisfying Assumption [T] similar results can be shown 
for other temporally correlated channel models in which changes in the channel are zero-mean and 
independent of the previous channel realizations. 



'in fact 1231 , 1241 make the simplifying assumption that g[t ] VO < t < t — 1 is available during predictor design. We relax 
this assumption, by only allowing access to g[t ] VO < i < t — 1, at the cost of adding a milder assumption on the quantization 
error in g[t]. Results hold, and derivations are simplified, if we revert to the assumptions of 1231 . II24I . 
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Assumption 2: We assume that the quantization error introduced by our algorithm is isotropically 
distributed, i.e., the direction of g[t\ — g[t] is uniformly distributed, and that the error direction and 
magnitude are independent. 

For a channel following a first order autoregressive model, the predictor's objective function, E [[g[t]*h[t] p] , 
can be written as 

" / r 1 \ * 2" 

E[\g[trm\'] =iE 



cos(||ep[t]||)g[t - 1] +sin(||ep[t]||) (r/fh[t - 1] + V^^z^ 



E 



\rji {[g[t - 1], ep[t]] or h[t - 1] + mf3*^[t]f 



(17) 



where we define ^ = [cos (||ep[t]||) , sin (||ep[t] ||)]*, fj{ = ^1 - and x[t] = [g[t - 1]* , ep[t]*]* z[t] 
to simplify notation. Since z[t] CAf{0, j^^l) by Assumption [U and since the matrix [g[t — 1], ep[t]] 
is unitary and independent of z[t] by definition, the vector x[t] has distribution CM{0, j^l2) by the unitary 
invariance of the Gaussian distribution ll37l . Expanding the predictor's objective function, we get 



E [\g[trm\'] = E hf'h[t - 1]* [g[t - 1], ep[t]] (3P* [g[t - 1], ep[t]]* h[t - 1] + 



(a) 



2Re {vivMt - 1]* [S[t - 1], ep[t]] /3/3*x[t]} + fifx[t]*P(3*x[t]] 
E [7j,Mt - 1]* [g[t - 1], ep[t]] P(3* [g[t - 1], ep[t]]* h[t - I]] + r??! 



^=\f2cos2(||ep[i]||)E[|h[t-irg[t-l]|2 



+ rjf sin 



E 



|h[t-l]*ep[i]r 



+ E [2Re {r/f2 cos (||ep[t]||) sin (||ep[t] ||) h[t - l]*g[t - l]ep[i]*h[t - 1]}] + v^-, 

where (a) follows because x[t] is a zero-mean vector and independent of h[t — 1]* [g[t — 1], ep[t]] /3/3* 
and E [x[t]*/3/3*x[t]] = The expectation of the third term in (6) is zero as a result of Assumption |2] 
and the fact that ep[t] and g[t — 1] are instantaneously orthogonal. Moreover, leveraging the independence 
of ||h[t - 1]|| and h[t - l]/\\h[t - 1]||, and that E [||h[t - = 1, the objective function simplifies to 



E[\m*m\']=m'cos'{\\eM\)^\\s[t-imt-lf] +r?f2sin2(||ep[t]||)E[|g[t-l]*ep[t]r 



L 



Letting 'p = g[t — l]*g[t — 1], the channel g[t — 1] can be written as pg[t — 1] + yl — |ppe[t — 1] where 
e[t — 1] is a unit norm tangent vector representing the quantization error at time t — 1. By Assumption 
|2j e[t — 1] is uniformly distributed in the tangent space and independent of which results in 



E [\g[trm\'] = cos' (||ep[t]||) E [\pf] + rj^sm' {\\e,[t]\\) (1 " ^ [\p\'] )^ [|e[t - l]*ep[t]r 
m'cos' (||ep[t]||)E [IpP] +^^sm'{\\eM\) (1 - E + f?^ ^ , 



1 



L 
(18) 
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where (a) is since the isotropic distribution of e[t — 1] implies that the term \e[t — l]*ep[t]p is beta dis- 
tributed with mean E |e*ep[t] |^ = 1/(L — 1). Therefore the predictor's objective function is independent 
of the predicted direction of ep[t\. Further, E is practically such that E > (1 — E ), i.e. 



the quantizer's output is expected to be a more accurate estimate of the channel than a random vector. 



Thus E > (1 - E [|p|2])/(L - 1) and E [|g[t]*h[t] is maximized in ^ by letting ||ep[t]|| = 



As a result, the predictor's objective function is maximized by selecting ||ep[t]|| = and thus g[t] = 
g[t — 1]. This coincides with the differential feedback strategy presented, thus the performance of the 
proposed framework can not be further improved by such a predictor for the channel model considered. 

The derivation in this section, however, does not preclude potential benefits from more sophisticated 
prediction functions, especially when the band-limited nature of the channel's Doppler spectrum is 
exploited. While a rich body of literature on channel prediction does exist |[38l - H0l . work on predicting 
processes that evolve on the Grassmann manifold is limited to |[23l . Il24ll which use predictors and channel 
models similar to the ones considered in this section. The design of more sophisticated Grassmannian 
predictors that better exploit channel structure is left for future work. 



In this section, we discuss the proper initialization of the proposed algorithm and construct adaptive 
quantization codebooks for the tangent magnitude and direction. 

A. Initialization 

The proper operation of the Grassmannian differential feedback algorithm is ensured by the fact that, 
at each iteration, both transmitter and receiver can calculate a common quantized channel based on the 
quantized tangent vector and the previous quantized channel. For the output of the algorithm to be the 
same at the transmitter and receiver, however, a common initial vector, g[0], is required. Reinitialization 
can also be used to recover from feedback bit errors. This initial vector can, for example, be an estimate 
of g[0] that is fed back from receiver to transmitter in the first iteration along with e[0]. Alternatively, 
it can be based on a memoryless quantization of the channel using methods such as |[30l . |[311 . The 
additional overhead incurred is amortized when there are many subsequent feedback stages. Another 
option is to initialize the estimate g[0] to a common random vector from which the algorithm will be 
able to recover. In this paper, we use memoryless quantization with a random vector codebook at t = 1. 



which achieves 




(19) 



V. Design Considerations & Codebook Construction 
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B. Tangent Magnitude Quantization 

The tangent vector calculated in ([T4l ) can be decomposed into a tangent magnitude and a unit norm 
tangent direction. We propose to quantize the tangent magnitude and direction separately, which ensures 
that search complexity remains manageable. The problem of quantizing the tangent magnitude is that 
of quantizing a non-negative scalar. We propose to quantize the tangent magnitude using a Euclidean 
distance distortion metric 



where Cmag is the magnitude quantization codebook. The index of the minimizer is then sent to the 
transmitter via a delay and error-free link that requires A^mag = log(|Cmag|) bits. A locally optimal 
quantization codebook can be found algorithmically using Lloyd's algorithm BTI given either the prob- 
ability density function (pdf) of the tangent magnitude, or a training set of magnitudes which provide 
an empirical probability mass function. Finding the exact pdf of the tangent magnitude, however, has 
been intractable analytically thus far. Generating a good representative training set as input to the Lloyd 
algorithm is also difficult. The difficulty in both these approaches is that the magnitude pdf depends 
on the quantization process itself. That is to say that quantizing the tangent direction and magnitude 
itself could change their distribution in the next step. What is needed is therefore finding a pdf that is a 
fixed point of the quantization process, which has proven to be difficult. One solution which is adopted 
in 1241 is to uniformly quantize a range of magnitudes, ||e[t]|| G [0,1]. Unfortunately, 1241 has shown 
that quantization error in the magnitude creates an error floor beyond which CSI can not be improved 
in highly correlated channels where predictive feedback is most useful. In the limit of perfectly static 
channels, such magnitude quantization prevents feedback from converging to perfect CSI. 

We propose to adapt the quantization range, [aemiiji], /3emax[i]]> to the dynamics of the system and 
to uniformly quantize the magnitude in that range. To do so, the algorithm maintains a sliding window of 
the quantization range of interest. This adaptive process maintains a running average of the tangent vector 
magnitude and uses it to update the lower and upper limits of the quantization range. The quantization 
range, or window, is calculated using quantities that are commonly available at both transmitter and 
receiver. As a result, both nodes can independently keep track of the current quantization range. 

The moving average of quantized tangent magnitudes is given by: 



where r is the smoothing factor that can be adjusted by the system. At each iteration either limit of the 




(20) 



eiSC, 



Lag 




(21) 
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quantization range is then updated as follows: 

6mag [t] ^ Cavg [t] 

f-mag [t] ^ Cavg [^] 
Cmag [t] ^ Cavg [^] • 

This allows the feedback algorithm to accurately track the statistics of the tangent magnitude and quantize 
the current range of interest with higher resolution. In static channels, this allows the system to converge 
to perfect CSI removing the error floor caused by constant magnitude codebooks. 

The fixed scalars a and /3 must now be such that a < 1 < /3 to enable the flexible adjustment of 
the window [emin[i], CmaxM]- To see this, note that to let eminM decrease, it is necessary to be able 
to quantize values less than emin[i]- Similarly, allowing emax[i] to increase requires quantizing values 
greater than it. In simulation we use r = 5, a = l/2 and /3 = 2. While the chosen parameter values 
yield good results as shown in Section IVIIl system performance can be further improved by optimizing 
over the values of r, a, and /?. 

C. Tangent Direction Quantization 

The problem of quantizing the tangent direction vector is that of quantizing a unit norm vector which 
lies in the tangent space orthogonal to the base vector g[t — 1], that is e[t]*g[t — 1] = 0. Grassmannian 
codebooks often used for limited feedback l[30l . l[3ll are not suitable for tangent vector quantization 
since they do not enforce the structural constraint that requires the tangent direction codewords to be 
orthogonal to the base vector g[t — 1]. If the quantized tangent vector is not orthogonal to the base vector 
g[f — 1] , the geodesic path is undefined and the output of G(g[t — 1] , e[t] , £) does not lie on the manifold. 
Finally, note that the tangent space changes for each base vector g[t — 1] and thus the codebook must 
be adapted ll42l . i.e., fixed codebooks should not be used. 

To respect the varying tangent space geometry and orthogonality constraints, we propose a canonical 
generating codebook that is adapted at each iteration; only the canonical codebook is actually stored. This 
canonical codebook quantizes the tangent space at a special L x 1 base vector. The canonical codebook 
is rotated at each channel realization to match the geometry of the current tangent space. 

Definition 3: The A'^tiij-bit canonical tangent codebook is a codebook of 2^'"' unit norm vectors 
orthogonal to the special base vector xi, := [1,0, ... ,0]. Because of the structure of Xh, the generator 
codebook, Cdir(x;,), has elements, V£ € Cdir(xf,), of the form = [0, v^]* where is an (L — 1) x 1 



emin[i + 1] 
Cmax[^ ~l~ 1] 



Cmax [t\ 
(1 - 7) emaxM + ^emagM 
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unit norm vector. 

The canonical codebook can be constructed by appending a leading to a codebook of (L — 1) x 1 
vectors. The canonical codebook, which can be used to quantize the tangent space at x;,, can now be 
rotated at each iteration (or channel realization) to quantize the tangent space at a new vector x. We 
define this rotation as follows |[23l . ll42i . 

Definition 4: Let U(x) : C^^^ i-^ U^^^ be the function that determines the unitary rotation matrix 
that rotates x;, to x, i.e. U(x)x5 = x. 

Perhaps the easiest way to find such a rotation is to consider the Householder matrix. Let H(x, Xf,) = 
II — uu*/u*Xb where u = Xf, — x H3l . Note that H(x, x;,) is a unitary matrix and that if we let 
U(x) = H(x, Xfe) then U(x)xf, = x as required. Using the rotation operation it is possible to rotate the 
canonical codebook to the direction of any particular base point and generate tangent direction codebooks 
for an arbitrary channel. For a base point, g[t — 1], the quantizer chooses edir[t] from the rotated codebook 

edirW e Cdir(g[t - 1]) = {U(g[t - l])vi, U(g[t - 1])V2, . . . , U(g[t - l])v2«.,} . 

It can be verified that all vectors U(g[t — l])v£ lie in the tangent plane at %[t — 1] since g[t — l]*U(g[i — 
l])v, = x*v, = 0, V^. 

The proposed codebook design allows us to rotate a canonical codebook to perfectly match the tangent 
plane at each iteration. This ensures that the output of the Grassmannian differential feedback algorithm 
remains on the manifold. Moreover, if changes in the channel are isotropic in the L dimensional space, 
then so are the normalized tangent vectors, implying good performance with canonical codebooks that uni- 
formly quantize the tangent space at x;,. In Section IVlIl good performance is achieved even with random 
vector canonical codebooks. In general, for a given correlation matrix Rh^^ j., good canonical codebooks 
can be generated using Lloyd's algorithm, and different codebooks can be stored for environments with 
different correlation properties. 

To formalize the tangent direction quantization, recall that the estimated channel in the next iteration is 
G(g[t — 1], e[t], 1). Given that the loss in sum rate is related to the chordal distance between the channel 
and its estimate, the quantized tangent direction is computed as 

edirMM= argmin d(G(g[i - 1], e,„agMxj, 1), g[t]) = argmax |G(g[t - 1], CmagMx^, l)*g[i] (22) 

x.eCdi,(g[t-i]) xieCdir(g[t-i]) 

where the tangent magnitude, emag[i]> is given by the output of the magnitude quantization step. We note 
that although (l22l ) intuitively includes a rotation of the full canonical codebook, computational complexity 
can be reduced by noticing that an equivalent rotation can be applied to the channels g[t — 1] and g[t] 
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instead, and the codebook need not be actually rotated. The details of this simplification are omitted but 
can be seen by expanding the objective function d{G{g[t — l],emagMxj, 1). 

VI. Analytical Performance Characterization 

A complete performance analysis of differential or predictive quantization is difficult due to its recursive 
nature and, in our case, due to the CSFs Grassmannian structure ETl . Relevant performance analysis when 
the quantized information evolves on the Grassmannian manifold is limited to incomplete performance 
characterizations ll23l . ll24l . The analysis of the Householder scheme in ||23| is limited to characterizing the 
performance of an open loop predictor, similar to Section lTV-BI The resulting lower bound on performance 
in |[23l neglects the quantization process and is not a function of the number of feedback bits. The 
performance analysis in |[24l presents partial distortion bounds that are independent of the channel's 
temporal correlation and Doppler spread. 

In this section we derive an approximation for the distortion achieved by the proposed quantization 
strategy. Unlike the analysis in ll23l . |[24l . our characterization accounts for both the number of feedback 
bits and the channel's temporal correlation, thus capturing most of the fundamental system parameters. 
Given the distortion expression, we leverage our analysis in 1131 and the discussion in Section IIII-BI to 
characterize lA's mean loss in sum rate when the proposed feedback strategy is used. 

We derive an approximation for the quantizer accuracy defined as E [|g[t]*g[t] p] , under three mild 
assumptions. First, we specialize our analysis to the case of channels with relatively low Doppler spread 
where differential feedback techniques yield the most gain |[2ll . |[23l . In the low Doppler setting, we 
can make use of a small angle approximation |[24l . |[34l . Second, we neglect the error introduced by the 
tangent magnitude quantizer of Section |Vl Effectively we make the following assumption. 

Assumption 5: We assume perfect magnitude quantization, i.e. CmagM = Vt. 
This is a reasonable assumption for our analysis as the numerical results in Section IVII-AI indicate that 
the magnitude quantizer adopted achieves very low quantization error with as little as one quantization 
bit and approaches perfect quantization at all Doppler. Finally, the analysis re-invokes Assumption [T] for 
simplicity of exposition. 

We start by expanding P(r7f,7Vdir) = E [|g[t]*g[t] P] as 

21 



E 



cos(||e[t]||)g[t - 1] + sin(||e[t]||)^^ ) (cos(e„,agM)i[t - 1] + sin(e„,agW)edir[t][t]) 



^ 2 

?mag[t])+sin(||eM||)sin(e„,aJt]) ^[^]*;^;;-p[^] 
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By invoking Assumption |5l I'(r/f , A'dir) simplifies to 



V{rii,Ndi,] 



E 



2 cos^ 



)sin2(||e[t]||)Rc 





+ sin4(||e[t]||) 




2' 


I lleWII J 


lleWI! 





(23) 



Recall that at each new channel realization the quantized tangent direction ejir [t] is selected to minimize 
the chordal distance between g[t] and g[t], equivalently maximizing |g[t]*g[i]p. By invoking Assumption 
[51 maximizing |g[t]*g[t] ^ becomes equivalent to maximizing the instantaneous value of the right hand side 
of (l23l) . The distortion function in (l23l) . however, is new; it is neither a Euclidean quantizer's distortion 
function nor is it a Grassmannian quantizer's phase-invariant distortion function. As a result, neither 
Euclidean nor Grassmannian quantization results can be used to find an expression for 'D{r]{, A'^dir)- 

By analyzing a similar alternative quantization strategy, however, it is possible to gain insight into the 
behavior of ( [23] ) and derive an approximation for P(r/f , N^ij-). Instead of directly quantizing the tangent 
direction to maximize the instantaneous value of ([23l) . we propose to split edir[i] into two components, 
GdirM = e~''^''Weg[t]. The variables 9e[t] and eg[t] are then quantized separately using Nq and Ng bits. 
To keep the amount of feedback constant, we require Nq + Ng = N^ij.. Such separation, naturally incurs 
an increase in distortion when compared to our original "joint" quantization strategy. Therefore, after 
deriving a distortion expression for the separate quantization strategy, we pick Nq and Ng to optimize its 
performance which leads to a more accurate characterization of the original "joint" quantization strategy. 

Inserting the proposed tangent decomposition into ([23]) yields 

^ e~^^'^'^e[t]*eg[t]^ 



V{7]i,Ndn)=^ 



cos^(||e[t]||) + 2cos^(||e[t]||)sin^(||e[t]||)Re. 



+ 



sm 



e-iUt]e[t]*eg[t] 



(24) 



E 



cos^(||e[t]||) + 2cos2(||e[t]||)sin2(||e[t]||) 



e[t]*eg[t] 



cos[Ze[t]*eg[t] - Oeit] 



e[t]*eg[t] 



sin^dleMII) 



(25) 



where Ze[t]*eg[t] is the phase of the inner product e[t]*eg[t] and as a result Re|e^-''^=We[t]*eg[t]| = 
Re |eJ(^eW*SsW-eeM) |e[t]*eg[t]|| = \e[t]*eg[t]\ cos (^Ze[t]*eg[t] - ^e[t]y Due to the decomposition of 
edir[t] the distortion function decouples, i.e. the quantization of edir[t] decouples into a scalar quantization 
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step to minimize |Ze[i]*eg[t] — 9e[t\\ and a Grassmannian quantization step to maximize |e[t]*eg[t]|. 
Thanks to the decomposition, we can now use known results about scalar quantizers and Grassmannian 
quantizers to analyze the performance of our decomposed strategy. 

Recalling Assumption [T] and specializing the analysis to channels with relatively low Doppler as stated 



earlier, we apply a small angle approximation which can be used to see that e[t] « yr^z[t] El, 
||34l . Therefore by Assumption [T] and the small angle approximation, one can see that the tangent direction 
e[t]/||e[t]|| is isotropic and independent of ||e[t]|| and that sin^(||e[t] ||) = ||e[t]p = 



Using this fact, along with the approximation E [x^] E [x]^, which is accurate when x has relatively 



low variance, gives 

Hm, Ndn) - + 2%'(1 - %')E cos ( Ze[t]*eg[t] - O^lt] 



E 



e[t]*eg[t] 



+ 



(26) 



Now, using an NQ-hit uniform codebook over [0,27r] for Of,[t\ yields 

sin(7r/2^'') 



E 



cos [Ae[t]*eg[t] - Belt] 



7r/2^« 



(27) 



since the e[t] is isotropic which implies that Ze[t]*eg[t] is uniform ll30l . Further, using an L — 1 
dimensional Grassmannian codebook to construct the canonical codebook for eg [t] yields 



e[t\*eg[t\ 



< 1 



(28) 



2iVs/(L-2) • 

The proof of ( [28] ) follows from the derivations in llBll and is thus omitted. Inserting ( [271) and ( |28l ) into 
(l26l) yields the final result which we summarize in the following proposition. 

Proposition 6: Under Assumptions [T] and [51 the accuracy achieved by the proposed Grassmannian 
differential feedback strategy with a 2^="' direction codebook is 

2, sin(7r/2^'') 



^^(m , ^dir) ~ max 7]i + 2r/f (1 - r^' 

Ng, Ng 

s.t. Ne + Ng = iVdi, 



7r/2^« 



1 



+ (l-r?fY(l 



2Ng/{L^2) 



2Ng/{L~2) 



The values of Nq and Ng can be easily determined by a search over A'dir possibilities. 

Finally, we note that the approximation in Proposition [6l can now shed light on the performance of lA 
with the proposed feedback strategy. For example, it can be used in an identical manner as in |[T5l to 
evaluate the mean loss in lA sum rate. Since deriving lA's mean loss sum rate parallels our derivation 
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in |[T5l . and since the main results of |[T3l . |[T5l have been summarized via (ITOl ) and (fTTI ). we present the 
final result on sum rate loss and refer the reader to fTSl for more detail. When the differential feedback 
strategy, with A'dir bits for the tangent direction, is used to fulfill lA's CSI requirement in a channel with 
Vi = JqC^'^IdTs), the mean loss in sum rate is such that 

A;?_ < ^ log, f 1 + ^ E ''''^''-^'''^ (1 - Vi,„ N,,,))] , (29) 

where Se^k is the Kronecker delta function. As a result of (l29l ). lA sum rate with differential feedback 
is such that -Rsum ^ ^sum' ~ ^Rsum, whcrc i?sum^ the sum rate achieved with perfect CSI. We note, 
however, that the characterization given in ( [291 ) and derived using the methods flSl is known to be rather 
loose especially when CSI quality is poor, mainly due to the use of Jensen's inequality. Tighter, yet more 
involved, sum rate characterizations can be derived using alternative methods as in B4l . 

VII. Simulation Results 

In this section we present numerical performance results. First, we characterize the performance of 
Grassmannian differential feedback in relation to Doppler and codebook size, and explore the trade-off 
between codebook size and refresh rate. Second, we demonstrate the performance of lA when CSI is 
obtained via the Grassmannian differential feedback strategy. 

A. Quality of CSI 

First, we examine the quantization error resulting from using the proposed algorithm to quantize the 
channel's impulse response. We consider L tap channels which are temporally correlated according to 
the M-order autoregressive channel model defined in |[36l . Unless otherwise stated, our simulations use 
M = 200 and initialize the time series as in [i36il . Although the analysis of Section ITVl specialized M = 1, 
adopting a more complex higher order model in simulation provides more realistic performance results. 
We restrict our attention to an uncorrected uniform fading profile, that is Rh = -^I^.The normahzed 
channel vectors, constitute the correlated Grassmannian time series to be quantized. 

Fig. [3] shows the chordal distance between the actual and quantized channel for a three tap correlated 
channel with /bTs = 0.005. Comparing Grassmannian differential feedback to memoryless random 
vector quantization, we see that the proposed algorithm achieves a much lower chordal distance as early 
as the third or fourth iteration. The initial transient phase in which the quantization error is higher than 
the steady state is due to initializing the algorithm with an all ones vector, i.e. g[0] = -^[1,1,1]*. 
The quick convergence to the steady state performance indicates that there is limited loss due to random 
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Fig. 3. Chordal distance, d(g[t], g[t]), plotted over time for a slowly channel with normalized Doppler /dTs = 0.003. This 
shows the high quality CSI achieved by the proposed algorithm with 6 and 2 bits for channel direction and magnitude respectively, 
compared to |24J and memory less quantization with a random codebook of 8 bits. 




Fig. 4. This shows the average chordal distance, or quantization error, versus the number of bits allocated for the tangent 
magnitude (1 to 10 bits), while keeping direction bits fixed. This shows the good performance of the proposed algorithm when 
compared with 1241 . 



initialization. The proposed algorithm also achieves a much lower steady state error compared to the GPC 
algorithm presented in |[24l due to the adaptive codebooks presented. Examining the average quantization 
error achieved by both the proposed algorithm and the GPC algoiithm of ll24l . we see that the proposed 
algorithm consistently outperforms. This can be seen in Figs. |4] and |5] 
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Fig. 5. This shows the average chordal distance, or quantization error, versus the number of bits allocated for the tangent 
direction, while keeping magnitude bits fixed. This shows the good performance of the proposed algorithm when compared with 
(24). 



Fig. Hlplots the average error of both algorithms vs. Doppler spread for a varying number of magnitude 
quantization bits, Ni-ng,g. The adaptive magnitude codebook, with even 1 bit of feedback, achieves 1.5dB 
lower distortion than the GPC algorithm with 10 bits of magnitude feedback. Adapting the codebook as 
presented in Section |V] allows accurate quantization and leaves very little to be gained from increasing 
the number of magnitude bits. Therefore, while ll24l requires an increasing number of magnitude bits at 
low Doppler, which is the case of interest, our proposed algorithm works well with 1 bit for all Doppler 
spread. Moreover, if the total number of feedback bits is fixed, and the bits are allocated optimally across 
the direction and magnitude codebooks, it can be numerically shown that the optimal number of bits 
allocated to the magnitude is always 1. 

Fig. |5] plots the average error for a varying number of direction quantization bits, A'dir, while keeping 
-^mag constant. Fig. |5] again shows that the proposed algorithm consistently outperforms existing feedback 
schemes. We can see that the proposed algorithm can achieve up to 13dB lower quantization error than 
previous schemes. This is due to the proposed magnitude and tangent codebooks which adapt to the 
geometry of each point on the manifold. Moreover, Fig. |5] shows that, for the proposed algorithm, 
increasing the codebook size from 3 to 4, or 4 to 5 bits offers the greatest improvements in quantization 
error, after which only marginal improvements are experienced. Interestingly, as opposed to further 
increases in codebook size, going from 3 to 5 bits does not introduce a constant improvement at all 
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TABLE I 
Minimum Direction Bits 



Length of Channel 


Suggested Direction Bits 


2 


3 


3 


5 


4 


7 


5 


9 


6 


11 



Doppler and instead changes the slope of the curve providing much larger gains in slow fading channels. 

Investigating this phase transition behavior further we notice that an error floor exists at low Doppler 
for all tangent direction codebooks smaller than 5 bits. Moreover, this phase transition is independent of 
the number of bits allocated to the magnitude, i.e. quantizing the magnitude with infinite precision does 
not change this behavior. Examining channels with more taps, we notice that this behavior is recurring, 
and that the phase transition occurs according to Table Jl It is interesting to note that this phase transition 
behavior for an L-tap channel always occurs at 2L — 1 bits, which is the number of free variables in the 
tangent direction. A formal investigation of this is left for future work. 

Fig. [6] plots E [|g[t]*g[t]P] vs. /dTs for a practical range of A'^dir to evaluate the accuracy of the 
performance characterization derived in Section |Vll Similarly to Section |Vll we consider the performance 
of the differential algorithm for a first order auto regressive channel model. As expected, the performance 
characterization is most accurate for slow fading channels and is reasonably accurate in faster fading 
scenarios. The reason for this is that the small angle approximation used is naturally suited for relatively 
low Doppler spreads. Moreover, the performance characterization gradually becomes more accurate as 
A^dir increases. Recall that the result was derived using a "separate" quantization scheme in which the 
tangent direction was decomposed into a phase angle, and a phase invariant direction. The bit allocation 
between the 2 variables, Nq and Ng, was then optimized to better characterize the superior "joint 
quantization" approach. The penalty due to such separate quantization is large when a small number of 
bits is available as the "product" of the two codebooks does not efficiently fill the tangent space. As the 
number of bits increases, the penalty due to such separate quantization diminishes and the performance 
characterization becomes more accurate even at high Doppler. Such behavior is in line with previous 
results on separate vs. joint quantization BTI . 

Finally, we highlight an issue that has often been overlooked in CSI feedback, with a few exceptions 
such as the analysis in |[25l . fl31 for single-user systems. Namely, the numerical analysis thus far has 
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Fig. 6. This figure compares the simulated performance of differential feedback with the performance characterization derived in 
Section lVll While the characterization is loose when feedback bits are low, due to the significant loss from separate quantization, 
the characterization becomes increasingly accurate as resolution increases. 

assumed a fixed normalized Doppler spread fuTs, and a fixed number of feedback bits. Both these 
quantities, however, can be controlled by the system while keeping the overall overhead of feedback, i.e. 
feedback bit rate, fixed. When such optimization is possible, feedback frequency and codebook size can 
be traded-off depending on channel conditions, resulting in lower distortion. Fig. |7] shows that increasing 
codebook size while keeping overall overhead constant can significantly deteriorate performance in fast 
fading channels. On the one hand, slow fading channels can support a large delay between high resolution 
feedback updates, since the channel does not change appreciably. On the other hand, decreasing the 
feedback time Tg to counter the high Doppler frequency /d yields better performance even if each 
update carries fewer bits. A formal analysis of this effect in the context of Grassmannian differential 
feedback is left for future work. 

B. lA Performance Analysis 

In this section we present simulation results to demonstrate the performance of lA when channel 
knowledge at the transmitter is obtained via the Grassmannian differential feedback strategy detailed 
in Section |lVl We assume that the channels are correlated according to the same autoregressive model 
used in Section I VII- A [ and that all the channels have equal Doppler spreads. Moreover, we assume that 
Pk,i = Pi Vfe, Thus we define the signal-to-noise ratio as SNR = Since the frequency extended 
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Fig. 7. Keeping the total feedback rate fixed at 5000 bits/s, this figure shows the average chordal distance or quantization error, 
versus the number of feedback bits. This shows that it is most often optimal to send frequent low resolution feedback. While 
slow fading channels can support and benefit from large feedback update periods, the performance in faster fading channels 
significantly deteriorates if refresh rate is sacrificed for codebook size. In all cases iVmag ~ 1 and A'^dir is varied. 



system can be viewed as a virtual MIMO system, the sum rate achieved is calculated as H6l . BTl . 

-1 



^ 1 

Rsnra = ^ log2 
k=l 



and the precoders, 



1 + 



N^rP 



di 



N^rP 



k,m 



mf^k 



, (30) 



f 1 f2 'fdk 
^fc ' ^fc ' • • • ' ^A; 



, are calculated given ideal or estimated CSI. We note here that 
the sum rate calculated in (1301 ) assumes that receivers use the perfect CSI available to them for decoding. 
For the results in this section, we use a modified version of the interference alignment algorithm proposed 
in lEl, with the number of data symbols given by the conditions in |[T1. Although a closed form solution 
for the lA precoders exists for the single antenna frequency extended interference channel in m, it has 
been shown that the solution in H] yields low sum rates if not further improved as in [48]. 

Fig. [8] shows the sum rate achieved by 3 users using lA over 16 channel extensions. CSI is obtained 
by our proposed algorithm, the strategy in |[24]| . and memory less vector quantization. For all the feedback 
strategies shown, each 3-tap channel is quantized using 10 bits. For our feedback strategy and that of 
ll24ll . 7 and 3 bits are used for the tangent direction and magnitude respectively. From Fig. [H one can 
show that, in the case of perfect CSI, the rate of increase of sum rate with SNR is 1.35, which approaches 
the 1.46 degrees of freedom predicted in theory over 16 subcarriers. 
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Fig. 8. This figure shows the performance of lA with imperfect CSI obtained through the proposed algorithm, the algorithm 
in 1241 . as well as random vector quantization. This shows that for slowly varying channels, the proposed algorithm allows 
interference alignment networks to come very close to the perfect CSI upper bound. 

As for the performance of lA with Grassmannian differential feedback, Fig. [8] shows that the sum rate 
achieved by lA is significantly improved by exploiting temporal correlation in the channel. Although 
multiplexing gain will likely never be preserved with a constant number of feedback bits, we see that 
for correlated channels, the proposed algorithm exhibits close to perfect performance over a wide range 
of SNR. Practically, this is in fact what is needed. Since a system will almost never function in the 
asymptotically high SNR regime, the goal in practical systems is to optimize for the medium to high SNR 
regime where interference alignment is likely to be used. The proposed algorithm succeeds in providing 
negligible sum rate loss at up to 30dB, in channels with a normalized Doppler of up to 10^^. In fact this 
can be achieved with much less feedback bits as opposed to the 10 bit feedback performance shown in Fig. 
[8] The proposed algorithm continues to outperform memoryless quantization even at a significant Doppler 
of JbTs = 0.05 regardless of whether random codebooks (RVQ) or optimized codebooks, generated 
using the well-known Lloyd algorithm BTI . 1491 . are used. The proposed algorithm also consistently 
outperforms the GPC algorithm in Il24l as a result of the adaptive magnitude and direction codebooks 
used. Finally, Fig. [8] shows the analytical sum rate predicted by (|29l ) for the case of /bTs = 0.003 and 
0.01. As stated in Section |VlJ the rate characterization is seen to be rather accurate for high SNRs up 
to 30dB, when 10 bits provide CSI of sufficient accuracy, but becomes progressively looser as SNR 
increases further and the CSI requirement becomes more stringent. 
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VIII. Conclusion 

Limited feedback is a practical way to meet the CSI requirements of lA. To provide high resolution, we 
proposed a feedback strategy based on Grassmannian differential coding that uses the channels temporal 
correlation to track channel responses by moving along geodesic paths defined via quantized tangent 
feedback. We showed that lA in conjunction with the proposed algorithm can benefit from more accurate 
channel knowledge and provide better sum rate performance for a large range of SNR conditions and 
a limited number of feedback bits. Moreover, our algorithm exhibits significant improvements over 
earlier methods of feeding back correlated time series on the manifold due to the optimized tangent 
and magnitude codebooks. 
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